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BACKGROUND OF THE INVENTION 



1. Field of the Invention 



[00011 The present invention relates to electronic devices, and in particular, to select 
signal circuits for low voltage swing circuits. 



2. Description of Related Art 

[00021 Low-voltage-swing (LVS) circuits have been developed as a means to decrease 
circuit area, increase throughput and consume less power than other circuit families. LVS 
circuits utilize large networks of pre-charged dual rail gates with low voltage outputs and 
sense amplifiers to restore the output to full rail signals, h. contrast to full-voltage swmg 
circuits, a low-voltage-swing logic circuit may provide valid data output signals based on 
a relatively small difference in potential between a pair of complementary data mput 
signals. More specifically, a first data wire is used to carry a true value of the valid data 
signal and a second data wire is used to carry a complementary value of the same valid 
data signal, hi a two-phase clocking arrangement, during a first clocking or pre-chargmg 
phase, both the data wires are pre-charged to a predetermined or pre-charge potential. At 
this point, the data wires do not contain any valid data or information. During a second 
clockmg or evaluating phase, the potentials on the two wires may diverge m response to 
the infomiation content of an input data signal, hi particular, one data wire "evaluates" by 
transitioning toward an evaluation potential and the second data wire remains at the pre- 
charge potential, the low-voltage-swing logic arrangement then provides a valid data 
output signal based on the potential or voltage difference between the valid true and 
complementary data mput signals. After the second clocking or evaluating phase, both 
data wires are again pre-charged to their pre-charge potential during a succeeding fnrst 
clocking or pre-charge phase. 

[0003] With respect to inputs to precharged-low LVS circuits, many circuit branches may 
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drive the same LVS node. In these topologies the select logic of the LVS circuit is such 
that during LVS evaluation phase, only one of these branches will drive the LVS node. 
Even though logically only one branch can be selected, it is possible that a branch selected 
during the previous cycle's evaluation phase is still turning OFF when the LVS circuit 
enters the current evaluation phase. This phenomenon can create potential differential in 
the direction opposite to the proper logical value. In such a case, this phenomenon slows 
down the operation of the LVS circuit. The reason for this is that the reverse differential 
(differential noise) must first be overcome before a differential can be built up in the 
correct direction. If a pulsed clock system is used to clock the LVS circuit and the 
contention period is significant, then the LVS circuit may not be able to overcome the 
reverse differential and a functional failure of an integrated circuit (e.g. a microprocessor) 
containing the LVS circuit may occur. 

[0004] Reverse differential build-up can be avoided by using precharged-low dynamic 
selects as an input to the LVS circuit. In this case the domino logic providing the dynamic 
selects needs to be in phase with the LVS circuit. However, a dynamic select is not a 
viable solution if the LVS circuit is distributed over a large distance, because in this case 
the LVS select maximum and minimum timing window is non-existent (negative) for 
sufficiently fast target frequencies. Furthermore, a dynamic signal distributed across a 
long distance is susceptible to pulse evaporation if the clock period is sufficiently short 
(high target frequency). Therefore, the use of static selects signals to enable a distributed 
LVS circuit is sometimes unavoidable. 

[0005] FIG. 1 illustrates the current state of the art for generating static LVS select 
signals for an illustrative LVS circuit 1 0, while minimizing or eliminating reverse 
differential build-up in the LVS circuit 10. A generic, master-slave flip-flop 12 drives the 
select signals with a minimum number of stages to the LVS circuit 10. The LVS circuit 
10 may be a distributed structure; therefore, the LVS select signal may have to span a long 
distance. The illustrative LVS circuit 10, in this case, is shown as a 3: 1 multiplexer 
circuit. The driving flip-flop 12 is triggered a phase ahead of the evaluation phase of the 
LVS circuit 10. In this example the select signals SA, SB, and SC are applied to the LVS 
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ei„„U ,0. These selec, si^s are mumaUy exclusive si^ls ^ .herefo. « on,y 

one of .he toee si^als wU. he HIGH during me evaluation phase of 

, , • ^wrinrKntrieeers the flip-flop 12 and the select 
The rising of *e flip-flop oloek s,^al (CLOCKl mggers . P P ^ 

signal that was HIGH in the previous cycle transtf ons fiom HIGH to LOW, jh 
L si^ for the current evaluation period transitions ...n IX>W to HIGH. Ensurutg 
that the ling-OFP edge of the previous select si^a. occurs before Ore LVS cucur 0 
hegins its next evaiu^on elintinates reverse differen^^ build-up in the LVS crcut. . M 
orL to speedup the data-dependant h^ing-OFF edge, the .u,rf>er of buffer stages 16 
between the fUp-flop 12 and LVS circuit 10 has been minimized to one. 

IW061 The prior art circuit arrangement of FIG. 1 illustrates the best-lorown method to 
Zate Jerse differential buUd-up, which is to speed up dre ^^-O^^^^-^^^ 
.dect signals such d>at dte LVS branch that is ON from the previous evaluatron cycle 
OFF before the next LVS evaluation phase begins. This is achieved by: (A) munmrzmg 
«,e number of buffer stages between the master-slave nip-flop 12 driving *e select 
signals and dre LVS ci,.uit 10; (B) skewing the buffer stages 16 between the nrp-fl P 
Ithe LVS circuit 10 such that Ure turning OFF edge is sp^ up atthe expe^e oHhe 
U^ing ON ^ge; and (C) speeding up the clo* signal to the fUp-flop 12 such that Ute 

n is triggered earUer h. order to speedup d>e arrival times of the select st^al s 
de-assertton and assertion edges at the gate inputs of the LVS ctcutt 10. 

,00071 Tire master-slave flip-flop 12 includes a master and a slave latch, with each 
Lin an open and a close state. The Utches pass through the data when in Ure open state 
andh!ldal)«.eda.awbeninthecloses.a.e.WhenthemasterIa.chinfl.ecloses.ate 

is holding dte data a^uited ftom the prevtous clock phase. Ure slave latch m the open 
state is passing the data being held by the master latch throu^ to the mp-flop s output 
When dre clock phase is reversed, the master latch swUches to its open sute to acqurre 
new data and at the same time *e slave latch switches .om .ts open state of „ 
nr^ter latch's output data to its close state otholding *e previous-provrded data. Hence, 
die master and slave latches operate "out-of-phase". 
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BRIEF DESCRIPTION OF THE DRAWINGS 

[00081 FIG. 1 shows a schematic of a prior art circuit arrangement for driving select 
signals to a LVS circuit. 

[00091 FIG. 2 illustrates a select logic circuit for a LVS circuit in accordance with one 
embodiment of the present invention. 

[00101 FIG. 3 is a schematic of one embodiment of the slave latch shown in FIG. 2. 

[00111 FIG. 4 is a schematic of one embodiment of the Dl-domino shown in FIG. 2. 

[00121 FIG. 5 is a timing diagram of a common clock signal generated by the clock 
generator shown in FIG. 2. 

[00131 FIG. 6 is a timing diagram of a first application of the select logic circuit of FIG. 
2 wherein a domino clock signal and a slave clock signal are the same. 

[00141 FIG. 7 is a timing diagram of a furst application of the select logic circuit of FIG. 
2 wherein the domino clock signal has a delay relative to the slave clock signal. 

[00151 FIG. 8 is a block diagram of a system including an integrated circuit having the 
select logic circuit and the LVS circuit shown in FIG. 2, in accordance with one 
embodiment. 
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DETAILED DESCRIPTION OF AN ILLUSTRATIVE EMBODIMENT 

[0016] In the following description, for purposes of explanation, numerous details are 
set forth in order to provide a thorough understanding of the disclosed embodiments of 
the present invention. However, it will be apparent to one skilled in the art that these 
specific details are not required in order to practice the disclosed embodiments of the 
present invention. In other instances, well-known electrical structures and circuits are 
shown in block diagram form in order not to obscure the disclosed embodiments of the 
present invention. 



[0017] With reference to FIG. 2, there is shown an illustrative low voltage swing (LVS) 
circuit 20 with a select logic circuit 22 in accordance to one embodiment of the present 
invention. Although many possible LVS circuits may be used with this select logic 
circuit 22, the illustrative LVS circuit 20 is a 3:1 multiplexer having three n-channel 
(NFET) transistors 24, 26, and 28 with their source terminals coupled to input signals 
LCS-INl, LCS-IN2, and LCS-IN3 respectively and with their drain tenninals coupled to 
a common node 30, so as to provide a single output signal on line 32. The transistors 
24, 26 and 28 have three input gate terminals 34, 36 and 38, respectively, which define 
three input temiinals for the LVS circuit 20. However, depending on the LVS circuit 
used with the select logic circuit 22, any number of input terminals is possible. As with 
all LVS circuits, the LVS circuit 20 utilizes pre-charged dual rail gates with low voltage 
outputs and sense amphfiers (not depicted) to restore the output to full rail signals. 

[0018] Although any number of inputs is possible, in the embodiment of the select logic 
circuit 22 of FIG. 2 there are three input signals: FF-INPUTl, FF-INPUT2, and FF- 
INPUT3 correlating with the three inputs 34, 36 and 38 of the LVS circuit 20. Hence, 
the select logic circuit 22 includes three circuit branches 40, 42, and 44. Each of the 
branches has the same components; hence, they are given the same reference numbers. 
Each branch 40, 42 and 44 includes a flip-flop 45 having in series a master latch 46, a Dl 
domino 48, a slave latch 50, and even number of buffering inverters 52 (at least one 
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pair). Each master latch 46 has an input to receive one of the input signals FF-INPUTl-S 
and each pair of buffering inverters 52 has an output coupled to one of the inputs 34, 36, 
38 of the LVS circuit 20. The output signals of the flip-flops 45 on are designated as 
select signals 'V. TTie Dl-domino 48 takes the form of a Dl-domino inverter and the 
slave latch 50 takes the form of a Set Dominant Latch (SDL) and. in particular, a SDL 
inverter. Although the Dl-domino is shown as an inverter, other Dl domino circuits 
may be used. An even number (at least two) of buffering inverters 52 are coupled 
between the slave latch 50 and the input gates of the LVS circuit 20. 

[00191 A clock generator 53 includes a clock source 54 and a delay buffer 55. Ma first 
implementation of the clock generator 53, the clock source 54 provides a first clock pulse 
signal (CLOCKl) to the three master latches 46 via a conductor 56 (also inferred to as 
the "master clock signal") and a second clock pulse signal (CL0CK2) to all three slave 
latches 50 via a conductor 58 (also referred to as a "slave clock signal"). The first and 
second clock pulse signals have the same frequency. The second clock signal is passed 
through a delay buffer 55 to provide a delayed second clock signal to the three Dl- 
dominos 48 (also referred to as the "dommo clock signal"). In a first appUcation of the 
select logic circuit 22, no delay is introduced between the slave and domino clock 
signals, i.e., no delay is introduced by the delay buffer 55 (therefore buffer 55 is not 
needed). In a second application of the select logic circuit 22, the second clock signal 
CL0CK2 is sped up (therefore, the slave clock is sped up) and a clock delay is 
introduced into the second clock signal by the delay buffer 55 so that the domino clock 
remains unchanged from the first application. THs delay allows for the slave latch 50 to 
be clocked earlier than the Dl-domino 48, which in turn allows for a higher target 
frequency as will be explained heremafter. These two applications of the select logic 
circuit 22 are described in detail with respect to FIGS. 6 and 7. The delay buffer 55 
includes an even number of inverters to delay the second clock signal. The use of the 
first and second clock signals CLOCKl and CL0CK2 provides the most freedom in 
clock timing. However, as described below, only a single clock signal may be used. 
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[00201 In a second implementation of the clock generator 53, a single, common clock 
signal is generated by the clock generator 53. This common clock signal is directly 
comiected to the slave latches 50 and, after a delay introduced by the delay buffer 55 
(again, an even number of inverters), is commonly comiected to the master latches 46 
(not depicted in FIG. 2) and the Dl-dominos 48. With respect to the second described 
application of the select logic circuit 22 (buffer delay), this clock generator 
implementation allows for the option of resetting the slave latches 50 earlier with respect 
to the master latches 46 and the Dl-dominos 48, but with a single clock signal. In other 
words, the common clock signal becomes the slave clock signal and thereafter the buffer 
delay of the delay buffer 55 is introduced to generate the master clock signal and the 
domino clock signal. With respect to the above-described first application of the select 
logic circuit 22 (no buffer delay), an extension of this second implementation of the 
clock generator 53 is to have all three clock signals (master, domino, and slave) consist 
of the same common clock signal (same frequency and phase shift). In all the variations 
described above, all the clock signals have the same frequency. 

10021] Figure 3 illustrates the slave latch 50 ofFIG. 2 in detail. A leg of transistors 
includes a first p-chamiel transistor PI with a source terminal coupled to the supply 
voltage VccT and a drain terminal coupled to a node 70, a first n-channel transition Nl 
with a source terminal coupled to the node 70 and a drain tenninal coupled to a node 72, 
and a second n-chamiel transistor N2 with a source terminal coupled to the node 72 and a 
drain terminal coupled to ground. A second p-chamiel transistor P2 is coupled between 
the supply voltage and node 70 and a third n-chamiel transistor N3 is coupled between a 
node 74 and ground. An inverter 76 is coupled between node 70 and node 74 and node 
74 is also comiected to the gate of the second transistor P2. The input data (which is the 
output signal of the Dl-domino 48) is provided over line 68 to gates of transistors PI and 
Nl . The second clock signal (slave clock signal) is a reset signal for the slave latch 50 
and is coupled to a gate of the transistor N2. The output signal of the slave latch 50 at 
node 70 is provided over the line 64. 
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[0022] Figure 4 illustrates the Dl -domino 48 of FIG. 2 in detail. Each of the Dl- 
dominos 48 is known in the art as a "Dl domino" and use domino logic. The Dl -domino 
48 consists of two main components: a pre-charge p-type field-effect device 80 having its 
gate coupled to the domino clock (delayed second clock signal), and a conditional pull- 
down n-type device stack (the inverter logic) 82 that evaluates the latch output signal of 
the master latch 46 provided via a line 66. The p-type pull-up device 80 is coupled 
between the positive supply, Vcc and an output node 84, with the node 84 also being 
coupled to the input of the slave latch 50 via a line 68. The n-chaimel device stack 82 is 
coupled in series with clocked n-type transistor 86 between node 84 and ground. As 
mentioned above, the device stack 82, which in this case is inverter logic (n-type field- 
effect devices not shown), is utilized to evaluate the input logic signal via line 66. 
Operation of the traditional Dl -domino 48 is relatively simple. When the domino signal 
coupled to the gates of the devices 80 and 86 is low, the circuitry is in a pre-charge 
phase. Domino output node 84 is pre-charged to a high potential during this pre-charge 
phase. The signal level of input to the n-chaimel device stack 82 is unimportant during 
the pre-charge phase, since the n-type devices of the inverter logic 82 coupled between 
the output node 84 and ground are nonconducting, i.e., "off. When the domino clock 
signal transitions high, the domino output node 84 is no longer actively driven by the p- 
type field-effect pull-up device 80. In the case, where the input satisfies certain logic 
conditions of the inverter logic 82, the output node 84 is discharged. Thus, when the 
domino clock signal is high, the domino circuit 48 is in an evaluation phase and when the 
clock is low, the Dl -domino 48 is in its pre-charge phase. 

[0023] Referring back to FIG. 2, each of the select signals SA, SB, SC is defined to have 
a "tuming-ON edge" (asserting edge) and a ''tuming-OFF edge" (de-asserting edge). In 
the illustrative embodiment of FIG. 2, the tuming-ON edge is a rising signal transition 
fi:om low-to-high and the tuming-OFF edge is a falling signal transition firom high-to- 
low. When the select signals are high they are "ON" and when low they are "OFF". 
The tuming-ON and tuming-OFF edges of the select signals are described as having 
"arrival times" at the gates 24, 26, and 28 of the LVS circuit 20. The select logic circuit 
22 is designed to speed up the arrival of the tuming-OFF edge of a given select signal to 
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substantially eliminate any possible contention with the tuming-ON edge of the next 
select signal and therefore to substantially eliminate any reverse differential build-up. In 
other words, the select logic circuit 22 is designed to increase the difference in the arrival 
time of the tuming-OFF edge of the given select signal and the arrival time of the turn- 
ON edge for the next select signal. This also may be characterized as reducing the 
difference in arrival times of the tuming-ON and tuming-OFF edges of the given select 
signal. The tuming-ON edge and tuming-OFF edge characterizations of select signals 
SA, SB, SC also describes the select signals "x". 

[0024] As explained above, the embodiment of the present invention is designed to 
substantially eliminate reverse differential build-up by speeding up the tuming-OFF edge 
of the select signals, such that the LVS branch 40, 42, or 44 which is ON from the 
previous LVS evaluation phase (cycle) is OFF before the next LVS evaluation phase 
begins. One factor in achieving the desired substantial eUmination of reverse 
differential build-up is that the select logic circuit 22 enables the tuming-OFF edge of the 
select signals to be generated independently of the input data. The operation of the select 
logic circuit 22 is described hereinafter to show how the tuming-OFF edge of the select 
signals is generated independently of the input data. To accomplish this, the prior art 
slave latch of FIG. 1 is replaced in FIG. 2 with the Dl -domino 48 followed by the slave 
latch 50. In other word, the Dl -Domino 48 and the slave latch 50 in combination 
undertake the slave operation with respect to the master latch 46. 

[0025] Referring to FIGS. 2 and 5, to assist in understanding the master-slave operation 
of the flip-flops 45, the circuits are defined as to whether they operate "in-phase" or "out- 
of phase". In FIG. 2, the Dl-domino 48 and the slave latch 50 operate in-phase with each 
other (with or without a delay or phase shift created by the delay buffer 55), but are out- 
of-phase with the preceding master latch 46. The master latch 46 and the slave latch 50 
each have an open state and a close state. The latches 46 and 50 pass-through the data 
when open and hold (latch) the data when in the closed. The in-phase and out-of-phase 
relationships of the circuits of FIG. 2 are described in more detail with respect to an 
illustrative common clock signal 90 in FIG. 5. When the clock state is LOW, the master 



10 



Attorney Docket No. 1 10348-134107 Express Mail No. EL973636910US 

Intel Tracking No. PI 7320 

latch 46 is in its open state to acquire new data. Also, the Dl -domino 48 is in its pre- 
charge phase (not receiving data from the master latch 46). The slave latch 50 is in its 
close state (not receiving data from the Dl -domino 48), where it holds the previously- 
provided data from the DlTdomino 48 and provides it as an output of the flip-flop 45 via 
the line 64 as select signals x. As designated by arrow 92, when the clock signal 
transitions from LOW to HIGH (the clock state is reversed), the flip-flop 45 is triggered. 
The master latch 46 switches from its open state to its closed state where it holds the new 
data it acquired in the previous clock phase when it was in its open state. Additionally, 
with the clock signal HIGH, the Dl -domino 48 is in its evaluation phase and the slave 
latch 50 is reset and in its evaluation phase (its open state). During this evaluation phase, 
the Dl -domino 48 and slave latch 50 acquire the output data of the master latch 46 and 
provide it as an output for the flip-flop 45. Hence, the Dl domino 48 and the slave latch 
50 operate "in-phase" (with or without the delay of created by the delay buffer 55), 
whereas the master latch 46 and the slave latch 50 operate "out-of-phase". The exact 
timing of the master latch 46 is irrelevant. All that is needed to be known is that in the 
flip-flop 45, the master latch 46 and the slave latch 50 are "out-of-phase" and that the Dl- 
domino must be "in-phase" with the slave latch 50 for correct functionaUty. 

[0026] With respect to FIGS 2 and 5, the low-to-high transitions of the clock signal 90 
provides timing for the select signals SA, SB, and SC in that these transitions trigger the 
tuming-OFF edges of the select signals. The LVS circuit 20 begins evaluation at the 
arrow 96 and the sense amplifiers (not depicted) of the LVS circuit 20 are enabled at the 
arrow 98 to sense the voltage differentials. Again, as with every low-to-high transition, 
the flip-flop 45 is triggered at the arrow 98. The previously mentioned first and second 
applications of the select logic circuit 22 now will be described with respect to FIGS. 6 
and 7 respectively. In both applications, the tuming-OFF edge of the select signals is 
generated independently of the input data, but instead are triggered by the slave clock 
resetting the slaves latches 50 in manner to be described hereinafter. 

[0027] Referring to FIGS. 2, in the first appHcation of the select logic circuit 22, there is 
no buffer delay between the domino and slave clocks. In other words, the delay buffer 
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55 of FIG. 2 between Dl-domino 48 and slave latch 50 is set to zero. The second clock 
signal CL0CK2 goes to both the slave latches 50 and the Dl-dominos 48 without delay 
(i.e., a common clock). In this first application, the domino evaluation phase of the Dl- 
domino 48 and the reset of the slave latch 50 are simultaneously triggered by the rising 
edge of the second clock signal (CL0CK2), which becomes a common clock signal for 
the two components. Within one gate delay of the rise of the clock signal CL0CK2, the 
output of the slave latch 50 falls and begins the tuming-OFF edge transition of the ON 
select signal, because the Dl-domino 48 has only just begun to evaluate and therefore the 
Dl-domino 48 is still in the pre-charge phase with its output on Une 68 still being in its 
pre-charge state. The tuming-ON edge of the next select signal is generated at the output 
of the slave latch 50 two gate delays after the rise of the second clock signal CL0CK2, 
because it is the result of the domino evaluation traversing through the slave latch 50. As 
mentioned before, it requires only one gate delay to generate the tuming-OFF edge of the 
previous select signal. 

[0028] Referring to FIGS. 2 and 6, the reset of the slave latch 50, and therefore the 
generation of the tuming-OFF edge of the select signal, is independent of the data in the 
master latch 46, even with the delay of the buffer 55 being set to zero. This is best 
shown in FIG. 6, where there is illustrated a master clock waveform 100, a domino clock 
waveform 102, and a slave clock waveform 104. In this case, all three clock signals are 
the same, but the important aspect is that the dommo clock waveform 102 and slave 
clock waveform 106 are the same (no clock delay introduced by the delay buffer 55). 
The master clock and the domino clock are fixed in timing. It is only the clock to the 
slave latch 50 that is variable. In other words, CL0CK2 is variable but the domino clock 
is not. If CL0CK2 is varied, then the timing of the buffer 55 must be set such that the 
domino clock timing has not changed, as occurs in the second application discussed 
below. In this illustrative case of the first application, CLOCKl is equal to CL0CK2. 

[00291 The different phases of the components described in FIG. 5 are also shown in 
FIG. 6. More specifically, with the master clock waveform 100, the master latch 46 is 
shown with the MASTER LATCH OPEN state and MASTER LATCH CLOSE state. 
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With the domino clock waveform 102, the Dl -domino 48 is shown with DOMINO 
PRECHARGE phase and DOMINO EVALUATION phase. With the slave clock 
wavefomi 104, the slave latch 50 is shown with the SLAVE LATCH CLOSE state and 
SLAVE LATCH OPEN state. When the domino clock signal 102 is LOW, then the Dl- 
dominos 48 are in their pre-charge phase and therefore the inputs of the slave latches 50 
are preset HIGH. When the slave clock 104 rises, the slave latches 50 begin to pull- 
down (SLAVE RESET) because the Dl -dominos 48 are HIGH. If a given Dl -domino 
48 remains high in its evaluation phase, then the associated slave latch 50 receives a one 
inversion head start in generating a falling transition, because the latch 50 does not wait 
to look at the evaluation value of the Dl -domino 48 to transition LOW. In this case, the 
speculative tum-OFF edge for the select signal is the correct guess by the slave latch 50. 
Even if the Dl -domino evaluates to LOW during its evaluation phase, the slave latch 50 
speculatively starts to transition LOW because when the slave clock signal rises, the Dl- 
domino output is still HIGH and therefore the slave latch 50 begins to transition LOW. 
In this case, the speculative tum-off by the slave latch 50 is a wrong guess. However at 
some point though, the Dl -domino 48 transitions low enough to be able to pull-up the 
slave latch 50 and correct it. This creates a gUtch on the slave latch's output, i.e., the 
slave latch 50 begins to transition low but then it pulled back up. 

[0030] Referring to FIG. 2, the LVS select signals on line 64, prior to passing through 
the inverters 46, are referred to as LVS select signals "x". Referring back to FIG. 6, the 
tuming-OFF edge of a given select signal x is shovm by part of its waveform 106. As 
shown by an arrow 108, this tuming-OFF edge is de-asserted one inversion after the rise 
of the domino/slave clock signals. The tuming-ON edge of the select signal x is shown 
by part of its waveform 110. As shown by an arrow 112, this tuming-ON edge is 
asserted two inversions after the rise of the domino/slave clock signals. 

[0031] Setting the clock delay of the delay buffer 55 between the Dl -domino 48 and 
slave latch 50 to zero is likely to yield the slowest tuming-OFF edge arrival time of the 
select signal at the gates of the LVS circuit 20. Even in this first application, the select 
logic circuit 22 is able to provide a tuming-OFF edge at the gates of the LVS circuit 20 
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r .u .„rr<>tit Qtate of the art illustrated in FIG. 1 • This is 
that matches the delay of the current state 01 me ar ,,,,usoand 

. u ff^rina inverter 52 between the slave latch 50 ana 
achieved while providing an extra buffenng mverter 

the LVS circuit 20 for better noise rejection and drive capabrhty. As mentioned abo 
"s^Xoftheselectlogiccircuit22maybeobtainedwiththedomino.^ 

rrblg the same common cloc. si^al or with the master, domino, and slave cloc. 

signals being the same common clock signal. 

Referring to FIGS. 2 and 7, the second application of the select logic circuit 22 is 

! . . •, Themaster domino and slave clock signals are shown by waveforms 
described m detail. The master, domin , ^^uv is introduced between 

114 116 and 118 respectively. In this second application, a delay is mtroduc 
1 14, 1 lo, diiu i X 1 1 1 o Tti this illustrative example, 

. , • „i 11 nf FIG 7 remams the same, but the slave 
.,atWe » FIG. 6. d.e domino clock .gnal 116 of FIG. 7r 

cock si^al 118 is adjusted «. bo 2 invers,o„s earUe,. Tins .s a«ompl y 
,„„i„oclocksisnalll6.ein.derivedb,buffe.n,*eslavec.„ok.^^U^^ 

„ in U,e delay buffer 55. As in FIG. 6. *e master clock s,^al and 

. , v=i<.alll6ofnG7arefixed.buttheslaveelocksignalll8isvanable. 

r :::cX sped up. b. .e dom.0 c..k remains .cb^scd ^ue . 
rZS^uffer 55. More spec.acally. die clock generalor 5, ,e„».es .e^ve 
ls;al..ac.oekad— .^^^^^^ 

^ Slave latch 50 wid«.ut worsening «,e time constran..s on the FF mPU 

■ ■ .u ,„™ ON edee Also as in FIG. 6, the phases/stages of the master 
compromismg the mm-ON edge. Also . i h«t will not be 

latches 46. Dl-domi„os 48, and slave latches 50 are shown m FIG. 7, bntwll 

described again. 

,0„3Z,,noperaaon,atthemoment.es.avecl.kst.al^^^^ 

:::r— Jowrrof.eda.v.ueona,.c66.om.el,..^^^ 
4rr.ausethedomtnooutputonaline68,emainsatitspre<hargephase value. The 
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LVS select signal x is de-asserts its tuming-OFF edge one inversion before the rise of the 
domino clock signal 1 16. This is illustrated by the tuming-OFF edge of the select signal 
X shown by a segment of its waveform 122 and an arrow 124 between the rising slave 
clock signal 118 ant the tuming-OFF edge of the LVS select signal wavefonn segment 
122. The tuming-ON edge of the select signal x is illustrated by its waveform segment 
126. As^shown by an arrow 128, the LVS select signal x asserts its tuming-ON edge two 
inversions after the rise of the domino clock signal 1 16. hi comparison with FIG. 6, this 
tuming-ON edge in FIG. 7 remains unchanged. On the other hand, the tuming-OFF edge 
of FIG. 7 is advanced in time (2 inversion speedup) relative to the tuming-OFF edge of 
FIG. 6. 

[00331 The select signal x on line 64 is matched in polarity with the final LVS select 
signal because there is an even number of buffering inverters 52 between the slave latch 
50 and the LVS circuit 20. Hence, the data independent HIGH to LOW transition of the 
slave latch 50 enables the FALL transition of the tuming-OFF edge of the LVS select 
signal to occur earlier than their occurrences in the prior art design described in FIG. 1 . 

[00341 hi summary, the select logic circuit 22 enables the tuming-OFF edge to be 
generated independently of the input data, histead, the slave clock signal resetting the 
slave latch 50 triggers the tuming-OFF edge. As described above, using the clock signal 
for triggering the tuming-OFF edge is speculative - it may or may not be correct. In 
other words, with the select logic circuit 22, the tuming-OFF edge is generated without 
waiting for the data and thereafter the output of the slave latch 50 may or may not be 
turned on depending upon the data. Referring to FIG. 2, by speeding up the second clock 
signal CL0CK2 relative to the first clock signal CLOCKl and adjusting the second clock 
(CL0CK2) that resets the slave latch 50 relative to the timing of the domino clock by 
djusting the delay buffer 55, it is possible to significantly speed-up the tuming-OFF 
dge of the previously ON select signal without further constraining the data inputs. 



a 
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[00351 There are several advantages of the embodiment of the select logic circuit 22, as 
will be described with respect to FIGS. 1 and 2. First, the select logic circuit allows for a 
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faster target frequency because it may produce a faster select tuming-OFF edge. Second, 
the select logic circuit 22 may speed up the dehvery of the select tuming-OFF edge 
without placing additional constraints on the inputs to the master latch 26. To the 
contrary, in the prior art design of FIG. 1, if the clock to the flip-flop 12 is sped up by n 
amount of seconds then the constraints on the flip-flop will also get more stringent by n 
pS. Third, the select logic circuit 22 enables the use of more buffering inverters 52 
between the master latch 46 and the LVS circuit 20. Since the tuming-OFF edge can be 
sped-up significantly by the select logic circuit 22, more buffer stages may be used to 
drive the select signal to the LVS circuit 20. The prior art design of FIG. 1 relies on 
minimizing the number of buffer stages between the driver and receiver to speedup the 
turning OFF edge of the select signals and therefore compromises the ability of the select 
branch to reject noise that can enter the LVS circuit 12. The robustness, and especially 
speed, of LVS circuits is very sensitive to noise on its controls (gates) and therefore the 
select logic circuit 22 allows for a faster and more robust LVS circuit. Fourth, the select 
logic circuit 22 does not rely on skewing of the buffering inverters 52 as occurs in FIG. 1 
and therefore it is a more robust solution (better edge-rate, better noise rejection). 

[0036] There are two aspects to the second advantage described above. First, the select 
logic circuit 22 permits the speeding up of the select signal's tuming-OFF edge without 
requiring that the FF-INPUT signal (see FIG. 2) come earlier. This is another reason to 
keep the master clock signal fixed for all cases, e.g., the two cases illustrated in FIGS. 6 
and 7. In fact, the master clock signal remains the same for both the prior art select logic 
circuit 10 of FIG. 1 and the select logic circuit 22 of FIG. 2. Second, the select logic 
circuit 22 avoids jeopardizing the timing of tuming-ON edge of the select signals when 
the tuming-OFF edge of the select circuit is sped up. With the master clock signal 
already fixed, it is desirable to keep the domino clock signal fixed between the first and 
second application of the select circuit 22 because moving this clock signal can violate 
the two aspects described above. Therefore, preferably the only clock with variable 
timing is the slave clock signal. 
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[0037] With respect to FIG: 7, it is not possible to keep "pulling in" the slave clock 
signal (shifting it to the left). The amount that the slave clock signal can be "pulled in" 
(i.e., delayed) is determined by the time it takes to pre-charge the Dl -domino 48 after the 
domino clock signal falls. 

[0038] Referring to FIG. 5, there is illustrated one of many possible systems 130 in 
which the select logic circuit 22 and LVS circuit 20 may be used. The two circuits 20 
and 22 are shown implemented in an integrated circuit (IC) 132. In one embodiment, the 
IC 132 is a microprocessor. In an alternate embodiment, the IC 132 may be an 
application specific IC (ASIC). The circuits 20 and 22 are included in a central 
processing imit 134 of the IC 132 and, more specifically, in an arithmetic and logic unit 
(ALU) and address generation unit (AGU). The LVS circuit 20, with the select logic 
circuit 22, is used as a front-end LVS multiplexer for the ALU and the AGU to 
substantially guarantee that the distributed static LVS selects do not create reverse 
differential build-up at the onset of LVS evaluation, especially for very fast target 
firequencies (7+ GHz integer executions). 

[0039] For the embodiment, the system 130 also includes a main memory 136, a 
graphics processor 138, amass storage device 140 and an input/output module 142 
coupled to each other by way of a bus 144, as shovra. Examples of the memory 146 
include but are not limited to static random access memory (SRAM) and dynamic 
random access memory (DRAM). Examples of the mass storage device 140 include but 
are not limited to a hard disk drive, a compact disk drive (CD), a digital versatile disk 
driver (DVD), and so forth. Examples of the input/output modules 142 include but are 
not limited to a keyboard, cursor control devices, a display, a network interface, and so 
forth. Examples of the bus 144 include but are not limited to a peripheral control 
interface (PCI) bus, and Industry Standard Architecture (ISA) bus, and so forth. In 
various embodiments, system 140 may be a wireless mobile phone, a personal digital 
assistant, a pocket PC, a tablet PC, a notebook PC, a desktop computer, a set-top box, an 
entertainment unit, a DVD player, and a server. 
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[0040] Although specific embodiments have been illustrated and described herein, it 
will be appreciated by those of ordinary skill in the art that any arrangement which is 
calculated to achieve the same purpose may be substituted for the specific embodiment 
shown. This application is intended to cover any adaptations or variations of the present 
invention. Therefore, it is manifestly intended that this invention be limited only by the 
claims and the equivalents thereof 
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